The Magic Of Reinforcement Learning With Human Feedback Rlhf